CDS
Accession Number | TCMCG041C25399 |
gbkey | CDS |
Protein Id | XP_010272554.1 |
Location | complement(join(2535366..2535410,2535557..2535720,2555440..2555533,2568447..2568608,2569189..2569351,2570003..2570111,2570278..2570372,2579630..2579765,2579904..2579973,2580060..2580158,2596915..2597063,2609129..2609339,2612971..2613105,2613225..2613297,2613471..2613643,2613808..2614038)) |
Gene | LOC104608305 |
GeneID | 104608305 |
Organism | Nelumbo nucifera |
Protein
Length | 702aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA264089 |
db_source | XM_010274252.2 |
Definition | PREDICTED: DNA mismatch repair protein MLH1 isoform X2 [Nelumbo nucifera] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGACGTAGAAACACCACCTCCATGCCCAAAGGAGCCTCCGAAAATCCACCGCCTCGACCAGTCGGTCGTCAACAGGATTGCCGCCGGGGAGGTCATTCAACGCCCCGTCTCCGCCGTCAAGGAGCTAGTAGAGAATAGCCTCGATGCTTCCGCCACCTCCATCAGTGTCGTTGTTAAGGACGGAGGTCTCAAGTTCATCCAAGTCTCCGATGACGGCCATGGAATCCGGTATGAAGACTTGCCAATACTCTGCGAAAGGCACACCACGTCGAAGCTATCGGCATTCGAGGACTTAGAGACTATAAAATCAATGGGGTTTAGGGGAGAGGCATTGGCCAGCATGACGTATGTTGGCCATGTTACAGTGACGACGATAACAGAAGGCCAGTTGCATGGTTACAGGGTATCTTATAGAGATGGTGCCATGGAGAATGAACCTAAAGCTTGTGCTGCTGTTAAAGGGACTCAGATAATGATTGAGAATCTATTCTTTAACATGAGCGCTAGGAGGAAAACGTTGCAGAATTCTGCAGATGATTATTCAAAGATAGTAGATTTGATAAGCCGGTTTGCAATTCATCACATAAATGTTGGCTTTTCTTGCAGAAAGCATGGTGCTGCCAGACCTGATGTTCACACAGTTGCTACATCCTCAAGAATTGATGCGATCAGATCTGTTTACGGAGTTGTGGTTGCTCGTGACCTATTGAGCATAACTGCTGCAGAGAATGACCCATCTAGACCAGTGTTTGAGATGAATGGGTTTATCTCCAATTCAAATTACAGTGCGAAGAAGATAACTATGGTTCTTTTTATCAATGACAGATTGGTCGAGTGCACTTCTTTGAAGAGGGCCATTGAAGTTGTTTACACTGCAACCTTGCCAAAAGCATCGAAGCCTTTCATTTACATGTCTATTGTGTTGCCGCCTGAGCATGTGGATGTGAACATACACCCAACAAAAAAAGAGGTTAGCCTTCTGTATCAGGAAAGCATCATTGAGAACATACAGGCTGCAGTTGAGTTGAAGTTGAGGAATTCAGATACTGTGAGGACATTCCACACACAGACAACACATCCCTCTACATCTGCTCCTCTTGGTGCAAGGAGGGATAACCAAATTAATTCCTCAGATCCTGTGTCAAAATCCCAGAAAGTCCCTGTGCATAAAGTTGTGCGAACAGATACTCTGGATCCTATGGGAAGATTGCATGCCTACTTGCCTGCCAAGCCTCCTAGACAACAAGGAGGGAATTCTTGCTTAACTGCTGTGCGATGTGCTGTGAGACGAAGAAGGAATCCAAAGGAAAGTGCAGATCTTACTAGCATTCAGGAGCTTCTGAGTGAAATTGATTCTAATTGTCACTCTGGTCTGCTGGACATTGTGAAGCATTGTACATTTATTGGAATGGCAGATGATCTTTTTGCATTACTTCAATACAATACCCACTTATATCTTGTTAATGTGGTGAATTTGAGCAAAGAACTTATGTATCAGCAAGTTCTACGTCGATTTGCCCATTTCAATGCTATACAACTAAGTGAGCCTGCTCCACTACCAGAGCTAATAATGATGGCACTGAAGGAGGATGTAGACCCAGAATGTAGCGAGAATGATGATCTAAAAGAGAAAATTGCTGAAATGAACACTGAACTGCTCAAGCAAAAAGCTGAAATGCTAGATGAATATTTCAGCATTCACATAGATCAAAAAGGGAATTTGTCTAGGCTTCCTGTCATACTTGATCAGTACACACCTGATATGGATCATGTGCCGGAATTTGTATTGTGTTTGGGCAATGATGTGGATTGGGAAGAAGAAAAGAATTGCTTTCGAACAATTTCAGCTGCCTTAGGAAATTTCTATGCTATGCATTCTCCTCTTTTGCCAAACCCTGAGGATAACAGTGCCATGGAAAATGAGATTGATGAGGAATTGATCTCGGAGGCAGCGACTGCATGGGCCCAGCGCGAATGGAACATCCAACATGTACTATTCCCGTCAATGAGACTTTTCCTGAAGCCACCTAATTCAATGGCTACAAATGGAACTTTTGTCCAGGTGACTTCAATGGAGAAACTTTATAAGATTTTTGAAAGATGTTAA |
Protein: MDVETPPPCPKEPPKIHRLDQSVVNRIAAGEVIQRPVSAVKELVENSLDASATSISVVVKDGGLKFIQVSDDGHGIRYEDLPILCERHTTSKLSAFEDLETIKSMGFRGEALASMTYVGHVTVTTITEGQLHGYRVSYRDGAMENEPKACAAVKGTQIMIENLFFNMSARRKTLQNSADDYSKIVDLISRFAIHHINVGFSCRKHGAARPDVHTVATSSRIDAIRSVYGVVVARDLLSITAAENDPSRPVFEMNGFISNSNYSAKKITMVLFINDRLVECTSLKRAIEVVYTATLPKASKPFIYMSIVLPPEHVDVNIHPTKKEVSLLYQESIIENIQAAVELKLRNSDTVRTFHTQTTHPSTSAPLGARRDNQINSSDPVSKSQKVPVHKVVRTDTLDPMGRLHAYLPAKPPRQQGGNSCLTAVRCAVRRRRNPKESADLTSIQELLSEIDSNCHSGLLDIVKHCTFIGMADDLFALLQYNTHLYLVNVVNLSKELMYQQVLRRFAHFNAIQLSEPAPLPELIMMALKEDVDPECSENDDLKEKIAEMNTELLKQKAEMLDEYFSIHIDQKGNLSRLPVILDQYTPDMDHVPEFVLCLGNDVDWEEEKNCFRTISAALGNFYAMHSPLLPNPEDNSAMENEIDEELISEAATAWAQREWNIQHVLFPSMRLFLKPPNSMATNGTFVQVTSMEKLYKIFERC |